Entry Name:  " UBA-Arcaya-Mendoza-MC2 "

VAST Challenge 2014
Mini-Challenge 2

 

 

Team Members:

LUDMER EDWARD  ARCAYA , UBA, eduard704@gmail.com

VALERIA SILVANA MENDOZA  , UBA, valeriamendoza79@gmail.com

Student Team:  YES

 

Analytic Tools Used:

Tableau

Excel

Ibm spss

 

Approximately how many hours were spent working on this submission in total?

total number of hours worked is 80 hours.

 

May we post your submission in the Visual Analytics Benchmark Repository after VAST Challenge 2014 is complete?  YES

 

 

Video:

 

UBA-ARCAYA-MENDOZA-MC2.wmv

 

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

 

Questions

 

MC2.1Describe common daily routines for GAStech employees. What does a day in the life of a typical GAStech employee look like?  Please limit your response to no more than five images and 300 words.

https://lh4.googleusercontent.com/NoUB9v9gHcryEL7NU8Z7G_njabdPPmmPIOyfTy9X0Cgo_JcD3TlX7UlGu2XLDFjYgppJpEayIastLakxIra6iKPMmojbKQKGgmUupIj69FC3ZrJgJvffLs_2HUw8GozPEQ

Figure 1.most frequented places  in  abila .

 

 

https://lh4.googleusercontent.com/eyeEbJJxSSl8bsPTjlewYdEqaHbYV13LxigU-SndKdYWS1LAMgtn8BxbFPzzXfsv1Jh7B4JsJNnWVDK9LAw_Kw3hz5vb574fBDk_gzGbu1RFEMhuc5Xm7KyU1AsIIGhX9g

Figure 2. Locations where employees  remain during the day .

 

 

https://lh4.googleusercontent.com/EUxhr08VV5ZrGAa46Y0GS79JYRV-t-64sjq5PsJl9YWr04gE-pamDpspS5SqmkGuYesilS4ZG-_misFV7L_onZhZMn4CPOuaRZSIZBuPkOx3YgpDYCLNdTJ8Qwc-I99wUw

Figure 3. frequented places through day times.

 

 

https://lh6.googleusercontent.com/7_R2uAcEq6ahzEtE8A2PRC_tlkYD0x7LmhWY1GFdzm_kOEDgoZPqsI4XKL7YBSmgC0UGo0lRuBLgz-1rwtaWOJ3TJUyfrsiNo7rsgDlrrBShQmkjz0LJUCb2DRuPYhUrVA

Figure 4.  Localization divided into clusters  in Percentages .

 

 

 

https://lh5.googleusercontent.com/T6rgyCDO6lLIfhubTeOUWYi2mCvi9K4lsoMsri-V_TXhRSYIECanTh5b8fs50CnGbGvihHqhgKNkfUgFlrdxS1gsW6DOInOh96OB55OECYCqMET30vFx5qJG_OQGJVU9dw

Figure 4.  Localization divided into clusters  in Percentages .

 

 

Common habits

       Executive Employees frequent  alberts fine clothing and they do it during night time

       Facilities employees frequent abila airport and Stewart and sons fabrication in the afternoon

       Places like Alberts fine clothing  , brew’ve been served  and fridos auto supply ‘n more are the most frequented during night time.

       Those employees whose positions are not identified went to fridos auto supply ‘n more.

       The most frequented places for all employees are fridos auto supply ‘n more , alberts fine clothing , jacks magical bearns, hallowed ground , brew’ve been served y abila airport.

        Employees stay at gastech during the afternoon more than in the morning

 

 

 

MC2.2Identify up to twelve unusual events or patterns that you see in the data. If you identify more than twelve patterns during your analysis, focus your answer on the patterns you consider to be most important for further investigation to help find the missing staff members. For each pattern or event you identify, describe

a.       What is the pattern or event you observe?

                     Unusual credit card expenses.

b.      Who is involved?

Alcazar, Lucas has showed only one transactional expense of $10,000

 

c.       What locations are involved?

                      This movement is made at Frydos Autosupply n' More

d.      When does the pattern or event take place?

                      Day 13 at 7:36 am

e.      Why is this pattern or event significant?

Because there is no other employee spending such a high amount in a single transaction. His expenses are below $300.

f.        What is your level of confidence about this pattern or event?  Why?

 

Please limit your answer to no more than twelve images and 1500 words.

 

Figura 5. Dendogram Employees through distances.

 

 

Figure 6. Unusual Employees credit card expenses

 

 

 

Figure 7. Employees mobility in Guy Gyros 

 

 

Figure 8. Employees mobility in Katerinas cafe

 

 

 

Figure 9. Employees mobility in Brewre

 

 

Figure 10. Employees mobility in Hippokampos 

 

 

 

 

 

Figure 11. Credit card using separated by hours

 

 

Figure 12. Employees with the highest expenses

 

 

Figure 13.

 

 

Figure14. Employees that used all credit cards .

 

 

 

 

unusual behavior patterns:

 

       all employees with unidentified positions went to fridos auto supply ‘n more.

       most of facilities employees were at abila airport.

       employees of the engineering area at drill technician met at katerina’s café, day 7, at 20 hs.

       the employee cocinado, hideki has done the same activities as the employees that are not identified with ids.

       security employees used their fidelity credit cards in guy gyros at 0 hours.

       employees of unknown and security áreas used their fidelity credit cards at brewre, at 0 hours.

       most employees go often to hippokampos, saturdays 11 and 18, between 19 and 20 hours.

       all information technology, executive and security employees use their debit/credit/fidelity cards at katerina’s café, the previous two weeks, between 13 and 19 hours.

       all employees that used their credit cards, used their fidelity one.

       facilities employees have the highest expenses in credit cards.

       facilities employees assigned as truck drivers used all cards.

 

 

 

 

MC2.3Like most datasets, the data you were provided is imperfect, with possible issues such as missing data, conflicting data, data of varying resolutions, outliers, or other kinds of confusing data.  Considering MC2 data is primarily spatiotemporal, describe how you identified and addressed the uncertainties and conflicts inherent in this data to reach your conclusions in questions MC2.1 and MC2.2.  Please limit your response to no more than five images and 300 words.

 

https://lh3.googleusercontent.com/M_kTGivPRD6J3RNLF8jzLjjbkHotcVN8s2_SgSPBMbnSMAFHYEfUYvXC9G5xHUINUsikJwcw0oAl36Y7FJfEb87uBBIPXl_MIaZmIZ1IFU8W3IgNXWvfd_z0uRXk--9D4Q

Figure 15. Routes identified by GPS

 

 

https://lh4.googleusercontent.com/lXQN__kbH4sXxDX6k4XgeuVbuElZzJM-yXeUPYJcf-nEyYQHppCyxy_a69kZ97LadZR9lPKxEKeGk0nWEEqsIFj62zCtiLIr5gaozOkFOjzjGAqskIfxdoBAZN7CCiRHJA

Figure 16. Spatial Data with unknown ids, obtained by GPS

 

 

Figure 17. GPS tracks of unassigned ids in the car-asigment tables

 

 

https://lh4.googleusercontent.com/JwdUxUCx80WMbzGGzSG4lq7sLyDn8efxasUC_Ugvkh0Fkx1qxg7apaUN2V5oS-WR_Piwo1LLxUE20qLJpIlKkFFbgssQiox_kDFi_eHAPn1QQqsgshcTdLGkyGRmnfePdQ

Figure 18. Employees with card transactions and no car assigned

 

 

Figures Explanation:

 

·         The figure with gps data shows that points are very closed in time, measured in seconds. Sumarizing all this points to a minutes level of aggregation, data was reduced from 6851,69 points to 18,797 points, making much easier data analysis.

·         For Spatial GPS data, we find a group of unknown ids.

·         In the group of employees that have a different behavior pattern, jerarquic cluster was used in order to conclude what was expected, that this group is completely different to others.